Automatic Text Simplification for German

نویسندگان

چکیده

The article at hand aggregates the work of our group in automatic processing simplified German. We present four parallel (standard/simplified German) corpora compiled and curated by group. report on creation a gold standard sentence alignments from sources for evaluating alignment methods this standard. show that one performs best majority data sources. used two as basis first sentence-based neural machine translation (NMT) approach toward simplification In follow-up work, we extended model to render it capable explicitly operating multiple levels using source-side language level labels improves performance with regard evaluation metrics commonly applied measuring quality text simplification.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building a German/Simple German Parallel Corpus for Automatic Text Simplification

In this paper we report our experiments in creating a parallel corpus using German/Simple German documents from the web. We require parallel data to build a statistical machine translation (SMT) system that translates from German into Simple German. Parallel data for SMT systems needs to be aligned at the sentence level. We applied an existing monolingual sentence alignment algorithm. We show t...

متن کامل

Automatic induction of rules for text simplification

Long and complicated sentences pose various problems to many state-of-the-art natural language technologies. We have been exploring methods to automatically transform such sentences as to make them simpler. These methods involve the use of a rule-based system, driven by the syntax of the text in the domain of interest. Hand-crafting rules for every domain is time-consuming and impractical. This...

متن کامل

Automatic Text Simplification via Synonym Replacement

In this study automatic lexical simplification via synonym replacement in Swedish was investigated using three different strategies for choosing alternative synonyms: based on word frequency, based on word length, and based on level of synonymy. These strategies were evaluated in terms of standardized readability metrics for Swedish, average word length, proportion of long words, and in relatio...

متن کامل

Automatic Text Simplification via Synonym Replacement

Automatic lexical simplification via synonym replacement in Swedish was investigated. Three different methods for choosing alternative synonyms were evaluated: (1) based on word frequency, (2) based on word length, and (3) based on level of synonymy. These three strategies were evaluated in terms of standardized readability metrics for Swedish, average word length, and proportion of long words,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Frontiers in Communication

سال: 2022

ISSN: ['2297-900X']

DOI: https://doi.org/10.3389/fcomm.2022.706718